Temporal Envelope and Fine Structure Cues for Dysarthric Speech Detection Using CNNs
نویسندگان
چکیده
Deep learning-based techniques for automatic dysarthric speech detection have recently attracted interest in the research community. State-of-the-art typically learn neurotypical and discriminative representations by processing time-frequency input such as magnitude spectrum of short-time Fourier transform (STFT). Although these are expected to leverage perceptual cues, STFT do not necessarily convey aspects complex sounds. Inspired temporal mechanisms human auditory system, this paper we factor signals into product a slowly varying envelope rapidly fine structure. Separately exploiting different cues present (i.e., phonetic information, stress, voicing) structure pitch, vowel quality, breathiness), two learned through convolutional neural network used detection. Experimental results show that both yields considerably better performance than only envelope, structure, or representation.
منابع مشابه
Consonant identification using temporal fine structure and recovered envelope cues.
The contribution of recovered envelopes (RENVs) to the utilization of temporal-fine structure (TFS) speech cues was examined in normal-hearing listeners. Consonant identification experiments used speech stimuli processed to present TFS or RENV cues. Experiment 1 examined the effects of exposure and presentation order using 16-band TFS speech and 40-band RENV speech recovered from 16-band TFS sp...
متن کاملObjective quality evaluation of noise-suppressed speech: effects of temporal envelope and fine-structure cues
While temporal envelope and fine-structure cues are known to be good predictors for speech intelligibility, it is not clear how well they are correlated with subjective quality ratings, particularly those using noise-suppressed speech. The present work evaluated the performance of two objective measures (i.e., NCM and TFSS), which were originally developed with primarily envelope or fine-struct...
متن کاملThe Role of Temporal Fine Structure Cues in Speech Perception
In this thesis, the importance of temporal fine structure (TFS) in speech perception is investigated. It is well accepted that TFS is important for sound localization and pitch perception, while envelope (ENV) is primarily responsible for speech perception. Recently, a significant contribution of TFS in speech perception has been suggested. This was linked to the improved ability of normal-hear...
متن کاملDetection of speech landmarks using temporal cues
In order to improve the performance of speech recognizers, particularly in degraded environments, it may be bene cial to integrate use of temporal information. As literature has shown that human listeners are able to use temporal cues in speech recognition tasks, this study examines algorithms for extraction of temporal cues in a speech signal. The task under analysis is the location of landmar...
متن کاملThe role of recovered envelope cues in the identification of temporal-fine-structure speech for hearing-impaired listeners.
Narrowband speech can be separated into fast temporal cues [temporal fine structure (TFS)], and slow amplitude modulations (envelope). Speech processed to contain only TFS leads to envelope recovery through cochlear filtering, which has been suggested to account for TFS-speech intelligibility for normal-hearing listeners. Hearing-impaired listeners have deficits with TFS-speech identification, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Signal Processing Letters
سال: 2021
ISSN: ['1558-2361', '1070-9908']
DOI: https://doi.org/10.1109/lsp.2021.3108509